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Abstract 

Logarithmic gaps have been used in order to find a periodic com- 
ponent of the sequence of prime numbers, hidden by a random noise 
(stochastic or chaotic). The recovered period for the sequence of the 
first 10000 prime numbers is equal to 8 it 1 (subject to the prime num- 
ber theorem). For small and moderate values of the prime numbers 
(first 2000 prime numbers) this result has been directly checked using 
the twin prime killing method. 



"There are some mysteries that 
the human mind will never penetrate. 
To convince ourselves we have only 
to case a glance at the tables of 
primes and we should perceive that 
there reigns neither order nor rule." 

L. Euler 



1 Introduction 



The prime number distribution is apparently random. The apparent ran- 
domness can be stochastic or chaotic (deterministic). It is well known that 
the Riemann zeros obey the chaotic GUE statistics (if the primes are inter- 
preted as the classical periodic orbits of a chaotic system, see for a recent 
review [1]), whereas the primes themselves are believed to be stochastically 
distributed (Poissonian-like etc. |2j). However, recent investigations suggest 
that primes themselves "...could be eigenvalues of a quantum system whose 
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classical counterpart is chaotic at low energies but increasingly regular at 
higher energies." |3]. Therefore, the problem of chaotic (deterministic) be- 
havior of the moderate and small prime numbers is still open. Moreover, there 
are also an indication of periodic patterns in the prime numbers distribution 
[3]- [7]. These patterns, however, has been observed in probability distribu- 
tion of the gaps between neighboring primes and not in the prime numbers 
sequence itself (see also Ref. [S]). The intrinsic randomness (stochastic or 
chaotic) of the primes distribution makes the problem of finding the peri- 
odic patterns in the prime numbers themselves (if they exist after all) a very 
difficult one. 

2 Logarithmic gaps 

It is believed that properties of the gaps between consecutive primes can 
provide a lot of information about the primes distribution in the natural 
sequence. The so-called prime number theorem states that the "average 
length" of the gap between a prime p and the next prime number is propor- 
tional (asymptotically) to Inp (see, for instance, Ref. |2]). This implies, in 
particular, statistical non-stationarity of the prime numbers sequence. That 
is a serious obstacle for practical applications of the statistical methods to 
this sequence. For relatively large prime numbers one can try to overcome 
this obstacle by using relatively short intervals |2] . Another (additional) way 
to overcome this obstacle is to use logarithmic gaps (logarithms of the gaps). 
The non-stationarity in the sequence of the logarithmic gaps is considerably 
'slower' than that in the original sequence of the gaps themselves. The log- 
arithmic gaps have also another crucial advantage. If the noise corrupting 
the gaps sequence has a multiplicative nature, then for the logarithmic gaps 
this noise will be transformed into an additive one. It is well known that 
the additive noises can be much readily separated from the signal then the 
multiplicative ones (see below). 

Before starting the analysis let us recall two rather trivial properties of 
the gaps, which will be used below. The sequence can be restored from the 
gaps by taking cumulative sum. Pure periodicity in a sequence corresponds 
to a constant value of the gaps (the period). 

Let us take, as a first step, logarithms of the gaps for the sequence of 
prime numbers. Then, let us compute cumulative sum of these logarithms as 
a second step. Then, we will multiply each value in the cumulative sum by 
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Figure 1: Autocorrelation of the v{n) functions for the In-sequence (circles) 
in the interval 9 x 10^ < n < 10^. The solid line correspond to autocorre- 
lation function computed for the telegraph signal generated by the Rossler 
attractor fluctuations overcoming the threshold x = 1. In order to make 
the autocorrelation functions comparable a rescaling has been made for the 
Rossler autocorrelation function. 

10 and will replace each of the obtained values by a natural number which 
is nearest to it. As a result we will obtain a sequence of natural numbers: 
7,14,28,35,49,55,.... which will be called as In-sequence. 

Let us now define a binary function v{n) of natural numbers n = 2, 3, 4, 
which takes two values +1 or -1 and changes its sign passing any number 
from the In-sequence. This function contains full information about the In- 
sequence numbers distribution in the sequence of natural numbers. 

Let us consider a relatively short interval: 9xl0^<n<10^. For this 
interval the autocorrelation function 

C(n, r) = {v{n)v{n + r)) - {v{n)) {v{n + r)) (2.1) 

computed for the f (n) function of the In-sequence will be approximately 
independent on n. Figure 1 shows the autocorrelation function (circles) 
computed for the v{n) function of the In-sequence for the 'short interval': 
9 X 10^ < n < 10^. Figure 2 shows an autocorrelation function (circles) com- 
puted for the v{n) function of the In-sequence for interval of much smaller 
natural numbers: 1000 < n < 2 x 10^ (here we rely on the much slower non- 
stationarity in the logarithmic gaps sequence in comparison with the original 
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Figure 2: As in Fig. 1 but for interval 1000 < n< 2 x 10^. 
gaps sequence). 

3 Models 

In order to understand origin of the oscillating autocorrelation function shown 
in Figs. 1 and 2 let us consider a very simple telegraph signal, which allows 
analytic calculation of its autocorrelation function. The telegraph signal 
takes two values +1 or -1 and it changes its sign at discrete moments: 

t„ = nT + C (3.1) 

where ( is an uniformly distributed over the interval [0, T] random variable, 
n = 1, 2, 3... and T is a fixed period. If g is a probability of a sign change 
at a current moment (0 < g < 1), then the autocorrelation function of such 
telegraph signal is: 

C{r) = {n- T/T){2q - 1)^-' + (r/T - (n - l)){2q - 1)" (3.2) 

in the interval (n — 1)T < r < nT. Figure 3 shows the autocorrelation 
function Eq. (3.2) calculated for q = 0.25, as an example (cf. with Figs. 
1 and 2). When parameter q approaches value 0.5 the magnitude of the 
correlation function oscillations decreases . At value q — 0.5 we have only 
linear decay in the interval < r < T and for r > T the correlation function 
C(r) = 0. 
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Figure 3: Autocorrelation function for the simple model telegraph signal: 
Eq. (3.2) with q = 0.25. 
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Figure 4: X-component fluctuations of a chaotic solution of the Rossler sys- 
tem Eq. (3.3) (a = 0.15, b = 0.20, c = 10.0). 
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In the model Eq. (3.1) ( was taken as a pure stochastic variable. This 
variable, however, can be also a chaotic (deterministic) one. Let us consider 
a chaotic solution of the Rossler system ^ 



dx 
'dt 



dy 

— = X + ay] 



dz 

— = b + xz — cz 
dt 



(3.3) 



where a, b and c are parameters (a reason for this system relevance will be 
given in Section 4). Figure 4 shows the x-component fluctuations of a chaotic 
solution of the Rossler system. 

Let us consider a telegraph signal generated by the variable x crossing 
certain threshold from below (Fig. 4). This telegraph signal takes values: 
+ 1 or -1, and changes its sign at the threshold-crossing points. Then, the 
fundamental period of the Rossler chaotic attractor provides the period T in 
Eq. (3.1), whereas the chaotic fluctuations of the variable x of the attractor 
provide chaotic variable ( in the Eq. (3.1) for the considered telegraph signal. 
Autocorrelation function computed for this telegraph signal has been shown 
in Figs. 1 and 2 as the solid line. In order to make the autocorrelation func- 
tions comparable a rescaling has been made for the Rossler telegraph signal's 
autocorrelation function. 

Now, let us restore the fundamental period Tq for the sequence of the first 
10000 prime numbers: 



where the period T for the In-sequence was taken from Figs. 1 and 2. 

4 Twin prime killing method 

One can expect that the smallest gaps between prime numbers are a source of 
the non-additive (high frequency) noise. Especially for small and moderate 
prime numbers where the minimal gap A = 2 occurs rather frequently. The 
primes separated by such gap are called twin primes. Let us kill one (say 
larger one) of the twin primes from each pair of the twin primes (a smart 
high frequency filter). Due to remaining part of the twins the subsequence 
of the primes preserves roughly the structure of the original full sequence of 
the primes (let us call this subsequence as K2-subsequence). 



To = exp(r/10) ~ 8 ± 1 



(3.4) 
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Figure 5: Autocorrelation of the v{n) function for the K2-subsequence (cir- 
cles) in the relatively short interval 6000 < n < 6600. 
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Figure 6: As in Fig. 5 but for interval 12000 <n< 13000. 
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Figure 7: As in Fig. 5 but for interval 19000 < n< 20000. 



On the other hand, for small and moderate prime numbers the K2- 
subsequence can be expected to be less corrupted by the non-additive high 
frequency noise than the original sequence of the primes. Indeed, figure 5 
shows autocorrelation of the v{n) function for the K2-subsequence (circles) 
in the relatively short interval 6000 < n < 6600. The dashed straight line 
indicates the characteristic linear decay (cf. with Figs. 1-3) and the arrows 
indicate the period T = 8 ± 1. Figures 6 and 7 shows analogous results 
for the intervals 12000 < n < 13000 and 19000 < n < 20000 respectively. 
Comparing this result with Eq. (3.4) we can see good agreement between 
the values of the hidden period of the prime number sequence obtained by 
the two different methods: logarithmic gaps and twin prime killing. 

It should be noted that the most probable value of the gap is A = 6 for 
the both original and K2-subsequence (one of the two integers nearest to any 
prime: p — 1 or p-|-l, always has 6 as its divisor). Although the hidden period 
does not coincide with the most probable value of the gap (and apparently 
it is a subject of the prime number theorem), the simple 6-divisor could be a 
starting point in an attempt to understand origin of the hidden periodicity 
(in particular, the end-point of the linear decay in the Figs. 5-7 is always 
equal to 6, cf. with Fig. 3). 
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